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Simulation  of  Systolic  Networks  with  a  Syntax  Directed  Solver  for 
Systems  of  Sequence  Equations 

Rami  G.  Mai  ham 


ABSTRACT 

\ 

The  main  objective  of  this  paper  is  to  supplement  a  model 
that  was  previously  suggested  for  the  verification  of  systolic 
networks.  A  simple  language  Is  presented  to  express  the  sys¬ 
tem  of  sequence  equations  that  models  the  operation  of  the 
network.  Then  a  syntax  directed  interpreter  Is  developed  to 
solve  this  system  for  specific  forms  of  the  inputs  and  to  pro¬ 
duce  the  corresponding  outputs.  This  technique  for  the  verifi¬ 
cation  of  a  certain  computation  on  a  systolic  network  is 
equivalent  with  the  simulation  of  its  execution. 
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1.  Introduction 


An  abstract  mathematical  model  for  the  specification  and  verification  of  sys¬ 
tolic  networks  (1]  was  introduced  in  [21  and  extended  in  [31.  The  basic  idea  of 
this  model  is  the  representation  of  the  data  appearing  6n  any  one  communication 
link  of  the  systolic  network  by  a  data  sequence.  Here  the  tth  element  of  the 
sequence  associated  with  some  link  is  the  data  item  that  appeared  on  that  link  at 
time  t.  where  as  usual,  one  time  unit  is  the  period  of  the  clock  that  synchronizes 
the  network. 

The  computation  performed  by  each  computational  ceil  In  the  network  is 
then  modeled  using  causal  operators  on  data  sequences.  More  specifically,  the 
sequences  on  the  output  links  of  a  certain  cell  are  described  as  a  function  of  the 
sequences  on  the  input  links  to  that  cell,  with  the  condition  that  the  element 
of  any  output  sequence  cannot  depend  on  some  (r+i)  element  in  any  Input 
sequence  for  i? 0.  in  other  words,  the  output  cannot  depend  on  future  inputs. 
This  is  called  the  causality  condition. 

in  order  to  model  the  computation  of  a  systolic  network,  we  establish  tor 
each  node  in  the  network  the  sequence  equations  describing  its  operation.  The 
solution  of  the  resulting  system  of  equations  is  called  the  network  I/O  description 
and  gives  the  output  sequences  of  the  network  in  terms  of  its  input  sequences, 
in  the  remainder  of  this  report,  we  will  use  the  term  sequence  equation  in  a  res¬ 
trictive  manner  to  indicate  an  equation  in  which  the  left  side  is  a  sequence  iden¬ 
tifier  and  the  right  side  a  sequence  expression.  This  Is  the  only  type  of  equa¬ 
tions  that  is  needed  for  modeling  the  operation  of  systolic  networks. 


It  is  clear  that  the  I/O  description  of  a  systolic  network  specifies  the  global 
behavior  of  the  network  by  providing  a  means  of  obtaining  the  network  output  for 
any  given  inputs.  However,  we  do  not  need  to  find  explicitly  the  I/O  description 
of  a  certain  network  in  order  to  verify  its  operation  for  specific  inputs.  In  fact,  it 
suffices  to  substitute  the  known  Input  sequences  into  the  system  of  equations 
modeling  the  network,  and  then,  by  sequence  manipulation,  determine  the  descrip-* 
tion  of  the  output  sequences. 

Unfortunately,  our  tools  for  manipulating  sequences  are  still  limited  and  in 
many  cases,  we  are  not  able  to  obtain  analytically  the  description  of  the  output 
sequences,  in  order  to  alleviate  this  problem,  we  describe  in  this  report  a  com¬ 
puter  system  that  was  developed  to  solve  Iteratively  any  system  of  consistent 
causal  sequence  equations  for  specifically  given  input  sequences.  In  this  context, 
an  input  sequence  is  simply  defined  as  a  sequence  that  does  not  appear  on  the 
left  side  of  any  equation  of  the  system. 

The  system  of  causal  equations  is  provided  to  the  solver  in  terms  of  a  very 
simple  language  called  SCE  (Systems  of  Causal  Equations)  that  we  describe  in 
Section  2.  The  equation  solver  then  represents  a  syntax  directed  interpreter  that 
executes  any  correct  SCE  program.  Such  an  interpreter  is  outlined  In  Section  3; 
it  reads  the  elements  of  the  input  sequences  from  an  input  file,  and  calculates 
the  elements  of  the  sequences  on  the  left  side  of  the  equations  specified  by  the 
program. 

it  should  be  noted  that,  by  definition,  data  sequences  have  infinite  length. 
However,  for  practical  reasons,  an  upper  bound  MAXT  Is  given  by  the  user  for  the 
number  of  data  items  of  all  sequences.  Elements  beyond  MAXT  in  any  sequence 
are  considered  to  be  the  don't  care  element  defined  in  [21. 


Although  the  solver  defined  in  this  report  is  quite  general,  its  immediate 
application  Is  to  the  verification  of  computations  on  systolic  networks.  More 
descriptively,  in  order  to  verify  a  computation  on  a  systolic  network,  we  first  write 
an  SCE  program  containing  the  equations  for  the  operation  of  the  network,  and 
create  an  input  file  containing  the  elements  of  the  input  data  sequences.  Then  a 
run  of  the  SCE  interpreter  provides  the  elements  of  the  output  data  sequences. 
The  verification  of  a  systolic  network  by  this  method  is  equivalent  with  the  simula¬ 
tion  of  its  execution.  However  this  approach  to  simulation  seems  to  be  more 
structured  and  separates  the  internal  details  of  the  simulator  from  the  concern  of 
the  user.  It  also  has  the  advantage  of  allowing  the  user  to  solve  partially  the  sys¬ 
tem  of  equations  modeling  the  network  and  then  to  use  the  SCE  interpreter  on 
the  portion  of  the  system  that  could  not  be  solved  analytically.  As  an  example, 
we  show  in  Section  4  how  the  operation  of  a  network  designed  for  the  LU 
decomposition  of  a  symmetric  banded  matrix  is  simulated  by  means  of  the  SCE 
interpreter. 

It  should  be  mentioned  here  that  the  model  in  [2]  and  another  model 
developed  independently  by  Chen  and  Mead  (4.51  are  based  on  the  same  idea  of 
separating  the  formal  specification  of  the  operation  of  the  network  from  the  data 
for  specific  computations.  However,  in  (41  the  operation  of  each  cell  is  defined 
by  a  functional  approach,  while  we  follow  an  algebraic  approach  to  define  rela¬ 
tively  few  sequence  operators  that  are  used  to  model  computational  cells. 


i 


-,T- 


-  4  - 


2.  TIm  SCE  language  for  specifying  Systems  of  Ceusel  Equations 


The  SCE  language  is  a  simple  expression  language  augmented  with  some 
input/output  facilities  and  looping  capabilities  that  provide  for  efficiency  in  the  writ- 
ting  of  programs.  In  its  current  form.  SCE  may  be  used  to  model  the  operation 
of  a  special  class  of  systolic  networks  in  which  the  units  of  information  are  real 
numbers.  However,  by  the  addition  of  new  rules  to  the  grammar,  it  is  possible  to 
model  other  types  of  systolic  networks  at  higher  or  lower  levels  of  architectures. 

By  the  first  rule  in  the  grammar  given  in  Appendix  A.  it  is  readily  seen 
that  an  SCE  program  consists  of  the  following  four  parts:  1)  The  declarations.  2) 
the  input  part.  3)  the  programs  body,  and  4)  the  output  part,  in  the  rest  of  this 
section,  we  will  clarify  the  semantics  of  the  language  assuming  familiarity  with  the 
data  sequence  operators  defined  In  [2]  and  (3). 
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Terminal  symbols  in  the  SCE  language  (see  Appendix  A)  can  be  classified 
Into  four  categories,  namely,  special  symbols  (e.g.  +.  -.  *.  ...).  reserved  words 
(e.g.  FOR,  OUT.  ...).  identifiers  and  constants,  where  a  constant  is  either  a  positive 
integer  or  a  positive  real  number  written  in  floating  point  format. 

In  order  to  ensure  a  clear  distinction  between  Identifiers  and  reserved  words 
in  SCE.  we  have  chosen  ail  the  reserved  words  in  the  language  to  start  with  cap¬ 
ital  letters.  On  the  other  hand,  any  string  of  alphanumeric  characters  starting  with 
a  small  letter  can  be  used  as  an  Identifier,  with  the  understanding  that  only  the 
first  six  characters  are  significant.  Identifiers  in  SCE  should  be  declared  in  the 
declaration  part  of  the  program  to  be  of  one  of  the  following  types: 

1)  Parameter  (rules  3-7):  A  parameter  is  assigned  an  integer  value  at  the  time  of 
its  declaration  and  this  value  is  substituted  textuaily  whenever  the  identifier  appears 
In  the  program. 

23  Index  (rules  8-11):  An  index  in  SCE  is  an  integer  variable  used  In  loop  control 
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and  in  selecting  elements  of  sequence  arrays. 

3)  Sequence  (rules  12-19):  Sequences  are  represented  on  the  machine  by  vectors. 
An  identifier  of  type  "sequence*  may  be  associated  with  either  a  single  sequence 
(rule  16)  or  with  an  array  of  sequences  (rule  15).  For  arrays  of  sequences,  the 
dimension  and  the  tower  and  upper  bounds  are  also  specified  in  the  declaration 

by  enclosing  these  bounds  in  curly  brackets.  For  example,  the  following  SCE 
statement  declares  s  as  an  n  dimensional  sequence  array 

SEQN  ad.  :u.  .  .1  :u  J 

II  n  n 

where  lj  and  u(.  /*1  .•••./»  are  the  lower  and  upper  bounds  for  the  Ith  dimen¬ 
sion.  Bounds  may  be  negative  but  should  of  course  satisfy  the  restriction  that 

uf  *ir  we  also  note  that  there  is  no  limit  on  the  dimensionality  of  an  array. 

After  the  declaration  of  an  array  of  sequences,  its  elements  may  be  identi¬ 
fied  (rules  38-41)  by  using  the  usual  selection  notation  s(p1  .  •  •  •  .  pn>.  where 

each  falls  into  its  corresponding  range,  that  is  //  . 

in  the  context  of  the  abstract  systolic  model,  a  data  sequence  is  associated 
with  a  communication  link  which  is  identified  by  its  color  and  the  label  of  the 

node  at  which  it  terminates.  In  order  to  simplify  the  SCE  specification  of  a  sys¬ 
tolic  network,  we  label  the  nodes  in  the  network  by  n-tuples  of  integers 

(Vj .  •  •  •  ,vn )  for  some  n.  This  enables  us  to  group  In  an  n-dlmensional  array  all 

the  sequences  associated  with  the  links  that  have  the  same  color,  and  naturally 

use  the  color  of  the  links  to  identify  the  array.  With  this,  the  sequence  associ¬ 
ated  with  any  link  y  .  Is  simply  the  element  (v. ,  •  •  •  .v  )  of  the  array  of 

V  I  ittirlf  I  If 

sequences  yO. 

Although  this  usually  leads  to  a  very  clear  SCE  specification  of  systolic  net¬ 
works.  it  Is  sometimes  inefficient  because  some  of  the  elements  in  the  array  may 
not  be  used.  For  example  in  the  LU  network  described  in  section  4.  a  triangular 


array  of  sequences  would  be  more  space  efficient  than  the  rectangular  array 
allowed  by  SCE.  In  such  cases,  more  efficient  storage  could  be  obtained  by 
applying  any  one  of  the  techniques  used  for  storing  triangular  and  sparse 
matrices. 

In  addition  to  arrays  of  sequences,  the  language  allows  the  user  to  declare 
single  sequences.  Three  standard  single  sequences  are  predefined  by  the 
language,  namely  the  don't  care  sequence  d.  the  zero  sequence  o  and  the  unity 
sequence  u.  As  their  names  imply,  the  elements  in  the  d.  o  and  u  sequences 
are  all  O-elements.  zeroes  or  ones,  respectively.  The  user  however  may  re¬ 
ded  are  the  Identifiers  d.  o  or  u  if  he  wishes  to  change  their  definitions. 

The  input  part  of  an  SCE  program  takes  the  form  of  a  single  INPUT  state¬ 
ment  (rules  66-73).  It  serves  two  purposes;  First  it  assigns  an  integer  value  to 
MAXT.  which  specifies  the  number  of  elements  to  be  considered  in  any  sequence, 
and  second,  it  specifies  the  sequences  to  be  read  from  the  input  file.  Nested 
FOR  loops  can  be  used,  to  any  level,  in  specifying  the  input  sequences.  For 
example,  in  the  program  presented  in  Section  4.  the  statement 

INPUT  (MAXT  18  .  FOR  I*  0.3  r(I.V  >: 

specifies  that  MAXT=18  and  that  the  input  sequences  are  r(0,l).  r(l.l).  r(2.1)  and 
r(3.1). 

Similarly,  the  output  part  of  the  program  takes  the  form  of  a  single  state¬ 
ment  (rules  74-78)  that  specifies  the  sequences  to  be  printed  on  the  output  file. 
FOR  loops  are  also  allowed. 

The  body  of  the  SCE  program  is  the  part  that  contains  the  specification  of 
the  system  of  sequence  equations.  It  consists  of  a  list  of  statements,  where  a 
statement  may  be  either  a  sequence  equation  or  a  FOR  loop  that  encloses  a  list 
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of  statements.  Each  equation  has  the  form 

sequence  specification  *  sequence  expression 

where  the  left  side  identifies  a  particular  sequence  and  the  right  side  is  an 
expression  composed  from  sequence  identifiers  and  sequence  operators.  Square 
brackets  may  be  used  in  sequence  expressions  to  override  the  precedence  rules 
defined  by  the  grammar.  Basically,  in  the  evaluation  of  expressions,  the  grammar 
associates  the  highest  priority  with  the  operators  defined  directly  on  sequences. 
Next  in  priority  is  the  scaiar  multiplication  operator  followed  by  the  operators 
and  Finally,  the  operators  '+'  and  are  evaluated  with  the  lowest  priority. 
With  these  precedence  rules  the  sequence  expressions  are  evaluated  from  left  to 
write. 

Although  many  other  sequence  operators  may  be  Incorporated  into  the 
language,  we  only  considered  the  operators  that  were  formally  defined  in  (21  and 
(3].  namely; 

The  positive  shift  operator  nr.  written  in  SCE  as  O(r). 

The  zero  shift  operator  fig.  written  In  SCE  as  Z(r). 

The  spread  operator  Qr.  written  in  SCE  as  T{r}. 

The  expansion  operator  .  written  in  SCE  as  E(r,k), 

r  k  m 

The  accumulator  operator  A  '  .  written  In  SCE  as  A(r.k.s)  and 

The  multiplexing  operator  written  in  SCE  as  M{r;wl .  •  •  •  .wn). 

Frequently,  the  shift,  zero  shift  and  spread  operators  are  used  with  r*l. 
For  this,  the  short  hand  notation  O.  Z.  and  T  may  be  used  instead  of  0(1).  Z(1) 
and  T(1).  respectively. 

The  two  element-wise  operators  U1  and  U2  of  rules  47  and  48  have  the 
same  priority  as  the  operators  *  and  /  but  their  semantics  are  not  defined  by  the 
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language.  As  indicated  in  Section  3.  U1  and  U2  may  be  defined  by  the  user. 

Finally,  we  note  that  rule  55  restricts  the  operands  of  the  accumulator 
operator  to  single  sequences  rather  than  sequence  factors  as  Is  the  case  with  the 
other  operators.  This  restriction  is  not  necessary  and  was  only  imposed  because 
it  leads  to  a  more  efficient  implementation  of  the  SCE  interpreter.  However,  it 
should  be  noted  that  this  does  not  affect  the  expressive  power  of  the  language 
because  we  can  always  define  intermediate  sequences  to  get  around  this  restric¬ 
tion.  For  example,  the  sequence  equation 

xll)  =  0(2)  x(i-V  +  AH.3.U  [  y(l+V  *  7  xf/-lJ  ] 
which  is  not  permitted  in  SCE  can  be  split  Into  the  two  SCE  legal  equations 

v(i)  =  yff+i;  -  T  xd-M 

xii )  *  OC2J  x(l- V  +  AH.3.1;  v(l). 


f? 
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3.  Overview  of  the  SCE  interpreter 


As  is  the  case  with  most  language  interpreters,  the  SCE  solver  has  two 
distinct  phases,  namely,  the  syntax  analysis  phase  which,  using  a  parse  tree  of 
the  program,  produces  an  intermediate  language  program,  and  the  actual  interpre¬ 
tation  phase  which  executes  the  intermediate  program. 

For  the  syntax  analysis  phase,  we  used  the  automatic  parser  generator 
YACC  [6]  existing  on  the  UNIX  operating  system  to  generate  an  LR(l)  bottom-up 
parser  that  accepts  any  syntactically  correct  SCE  program  and  generates  an  inter¬ 
mediate  program  in  the  form  of  a  sequence  of  tuples.  It  is  basically  a  finite 
state  machine  with  a  stack.  It  scans  the  input  program  from  left  to  right  and  is 
capable  of  reading  and  remembering  the  next  input  token  (terminal  symbol)  which 
is  called  the  look-ahead  token.  Depending  on  the  look-ahead  token  and  the  con¬ 
tent  of  the  stack,  the  parser  takes  one  of  the  following  actions; 

1)  Shift:  The  current  look-ahead  token  is  pushed  Into  the  stack  and  the  next  token 
is  read  in.  Also  a  tuple  describing  the  action  is  generated,  if  the  token  being 
shifted  is  a  special  symbol,  an  identifier,  or  a  reserved  word,  the  tuple  generated 
has  the  form  (Shift.n).  where  n  is  a  number  identifying  the  token.  On  the  other 
hand,  if  the  token  is  an  integer  or  a  real  constant,  then  the  tuple  generated  has 
the  form  (Shift-Integer. c)  or  (Shift-real. r).  respectively,  where  c  or  r  is  the  value 
of  the  constant. 

2)  Reduce:  This  action  is  taken  when  the  parser  recognizes  that  the  stack  con¬ 
tains  the  right  hand  side  of  a  grammar  rule,  say  rule  n.  and  that  this  rule  should 
be  applied  at  this  point.  It  then  pops  from  the  stack  the  tokens  forming  the  right 
side  of  rule  n  and  pushes  onto  the  stack  the  token  on  Its  left  side.  It  also  gen¬ 
erates  a  tuple  (Reduce.n). 

3)  Accept  This  action  is  taken  when  the  parsing  process  is  successfully  com¬ 
pleted.  The  tuple  generated  in  this  case  is  (Accept. 0). 
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4)  Error:  If  the  parser  discovers  that  the  program  is  syntactically  incorrect,  it  sim¬ 
ply  gives  a  warning  and  halts.  Of  course,  more  elaborate  error  handling  actions 

could  have  been  taken  if  our  goal  was  to  produce  a  more  sophisticated  parser. 

For  the  details  and  internal  forms  of  LR  parsers,  we  refer  to  (71. 

The  second  phase  of  the  interpreter  reads  the  intermediate  program 

(sequence  of  tuples)  and  reproduces  the  actions  taken  by  the  parser  on  an  action 
stack.  Simultaneously,  an  adjoint  value  stack  is  used  to  store  temporary  values 
needed  in  the  interpretation,  in  Appendix  B.  we  give  the  complete  listing  of  a  C 
program  for  the  second  phase  of  our  SCE  solver,  and  in  the  rest  of  this  section 
we  will  outline  the  main  features  of  this  solver/interpreter. 

The  program  uses  a  location  counter  "location’  to  indicate  the  intermediate 

tuple  being  interpreted.  Starting  with  location  =1.  the  Interpreter  reads  the  tuple 
pointed  to  by  "location",  takes  a  certain  action.  Increases  location  by  one  and 
then  repeats  the  above  cycle.  The  action  taken  in  each  cycle  depends  on  the 
type  of  the  tuple  being  Interpreted,  namely 

1)  If  the  tuple  is  of  the  type  (Shift.n)  or  (Shift-lnteger.n).  then  n  is  pushed  into 
the  action  stack. 

2)  If  the  tuple  is  of  the  type  (Shlft-real.r).  then  r  Is  pushed  Into  the  value  stack 
and  a  zero  Is  pushed  Into  the  action  stack. 

3)  if  the  tuple  is  of  the  type  (Accept.O).  then  the  Interpretation  is  terminated. 

4)  if  the  tuple  is  of  the  type  (Reduce.n).  where  grammar  rule  n  has  the  form 
b  -  a  1  a 2  •  •  •  ak  then  the  interpreter  pops  the  k  top  locations  of  the  stack, 
which  should  contain  the  symbols  a^,  •••  ,ak  and  pushes  b.  It  also  may  exe¬ 
cute  a  semantic  routine  if  any  is  associated  with  this  grammar  rule.  These  rou¬ 
tines  manipulate  the  data  on  the  value  stack  to  reflect  the  semantics  of  the  gram¬ 


mar  rule. 


At  this  point  we  note  that  we  do  not  actually  have  to  push  or  pop  the 
grammar  symbols  in  the  action  stack,  and  that  a  sufficient  action  will  be  to  keep 
track  of  the  top  of  the  stack.  Then  at  the  time  of  a  reduction,  the  grammar  rule 
and  the  top  of  the  stack  determine  uniquely  the  location  that  each  symbol  would 
occupy  on  the  stack.  With  this,  we  can  transmit  information  from  one  semantic 
routine  to  another  by  pushing  this  information  into  the  stack  in  the  place  of  the 
grammar  symbol.  For  example,  the  semantic  routine  associated  with  rule  36  uses 
the  location  of  the  FOR  symbol  on  the  stack  to  store  the  starting  address  of  the 
first  statement  in  the  FOR  loop  body.  Then,  when  the  execution  of  the  FOR  body 
is  terminated,  the  routine  for  rule  35  retrieves  this  address  from  the  stack  to  re¬ 
initiate  the  execution  of  the  FOR  body  if  the  final  value  of  the  index  of  the  loop 
is  not  yet  reached. 

in  order  to  reduce  the  storage  required  for  holding  the  intermediate  tuples, 
the  program  in  appendix  B  reads  and  executes  the  tuples  in  four  stages:  In  the 
first  and  the  second  stage,  the  declaration  and  the  Input  part  of  the  program  are 
processed,  respectively,  in  the  third  stage,  the  system  of  equations  is  solved,  and 
finally  in  the  fourth  stage  the  output  is  printed.  We  briefly  comment  on  each 
stage. 

The  declarations:  The  main  objective  of  this  stage  is  to  construct  the  symbol  table 
and  to  allocate  storage  for  the  declared  sequences.  The  symbol  table  *sym.tabOa 
is  an  array  of  records  with  three  fields.  The  first  field  contains  a  character  that 
indicates  the  type  given  to  the  identifier,  namely  P.  I  or  S  for  parameters,  indices 
or  sequences,  respectively.  The  Interpretation  of  the  integers  in  the  second  and 
third  fields,  called  entryl  and  entry2.  depends  on  the  type  of  the  Identifier.  For 
parameters,  entryl  contains  the  value  of  the  parameter  and  entry2  is  immaterial. 
For  index  variables,  entryl  holds  the  value  of  the  index,  initialized  to  zero,  and 
entry2  is  set  during  the  execution  of  a  FOR  loop  to  the  final  value  of  the  loop 


Finally,  if  the  identifier  is  declared  as  a  sequence  variable,  then  it  may 
denote  a  single  sequence  or  an  array  of  sequences.  Single  sequences  are  dis¬ 
tinguished  by  setting  entry2=-1.  with  entry  1  pointing  to  the  location  where  the 
sequence  is  stored.  For  arrays  of  sequences.  entry2  holds  a  pointer  to  a  bound 

table  that  indicates  the  dimension  and  the  bounds  of  each  array,  and  entryl 

points  to  the  location  where  the  first  sequence  In  the  array  is  stored.  The  first 
three  locations  in  the  symbol  table  are  reserved  for  the  identifiers  d.  o  and  u. 
respectively  that  are  preset  to  the  don't  care,  the  zero  and  the  unity  sequences, 
respectively.  However,  if  any  of  these  Identifiers  are  declared  in  the  program, 

then  the  corresponding  entry  in  the  symbol  table  is  overwritten  by  the  semantics 
routine  corresponding  to  the  new  declaration. 

The  sequences  are  stored  In  a  two  dimensional  array  seq_storeOQ.  Each 

row  in  the  array  has  a  length  at  least  equal  to  MAXT  and  is  used  to  store  the 

elements  of  a  sequence.  Arrays  of  sequences  are  stored  in  consecutive  rows 
such  that  any  Index  changes  slower  than  the  one  to  its  right,  if  any.  In  order  to 

keep  track  of  don't  care  elements,  an  array  d-tableOQ  of  bits  Is  used  such  that 

for  each  element  In  seq_storeQO.  there  is  a  corresponding  bit  in  d-tableOO.  This 
bit  is  set  to  one.  if  the  element  In  seq.storenD  is  a  don't  care,  and  to  zero, 

otherwise.  Consequently,  any  part  of  the  program  that  reads  an  element  from 

seq_storeO(]  should  also  Inspect  the  corresponding  entry  in  d-tableQO. 

The  Implementation  of  the  SCE  solver  listed  in  Appendix  B  allows  for  full 

causality  in  the  sense  that  an  output  may  depend  on  any  previous  input.  Conse¬ 
quently,  storage  is  provided  for  the  retention  of  at  least  MAXT  elements  from  any 
sequence.  A  more  space-efficient  implementation  would  be  to  retain  only  the  last 

C  elements  from  each  sequence  In  a  circular  buffer,  where  C  is  given.  This 
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allows  only  for  C-order  causality  in  the  sense  that  the  output  of  a  certain  cell  at 
any  given  time  t  may  only  depend  on  the  inputs  to  that  cell  during  the  time 
period  from  t-C  to  t-1. 

The  Input  pert  The  INPUT  statement  specifies  the  sequences  to  be  read  from  the 
input  file  as  well  as  the  number  MAXT  of  elements  In  each  sequence.  The  inter¬ 
preter  reads,  as  a  stream,  the  MAXT  elements  of  the  first  specified  sequence  fol¬ 
lowed  by  those  of  the  second  sequence....  etc.  provided  that  the  elements  are 
separated  by  at  least  one  space.  No  special  characters  are  required  to  separate 
the  elements  of  the  different  sequences.  Each  element  in  the  Input  file  may  be 
either  a  floating  point  number  or  the  letter  *d*  representing  a  don't  care  element. 
The  Interpreter  also  recognizes  the  string  •  •  •  •  ■  in  the  input  file  as  an  indication 
that  the  remaining  elements  in  that  sequence  are  don't  cares. 

The  equation  solver  The  sequence  of  tuples  In  the  body  of  the  program  are  exe¬ 
cuted  iteratively  MAXT  times.  A  global  dock  “TIME*  is  Initialized  to  1  and  incre¬ 
mented  at  every  step  of  the  Iteration.  At  every  step,  the  expression  on  me  right 
side  of  each  equation  is  evaluated  at  time  TIME  and  assigned  to  me  correspond¬ 
ing  element  of  me  sequence  on  me  left  side  of  the  equation. 

The  value  stack  is  used  during  the  evaluation  of  sequence  expressions  to 
store  temporary  results.  For  example,  the  semantic  routine  associated  with  me 
grammar  rule  52  <seq_factor  -  seq_spec)  reads  me  value  of  the  element  TIME  in 
the  specified  sequence  from  seq_storeQQ.  and  pushes  it  onto  me  value  stack. 
The  result  of  any  subsequent  sequence  operation  Is  stored  on  the  stack  until  rule 
37  is  executed  and  the  final  result  is  stored  back  into  seq_storeOO. 


The  sequence  operators  0.  Z.  T  and  E  operate  on  sequence  factors  and 
have  the  effect  of  changing  the  global  clock  during  the  evaluation  of  the 
corresponding  factor.  The  old  clock  value  is  stored  in  the  action  stack  for 
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retrieval  after  the  evaluation  of  the  factor  Is  complete  (rule  53).  If  the  result  of  f. 

any  operation  involving  the  above  operators  is  the  don't  c are  element,  then  the 
flag  ‘skip*  is  set  which  causes  the  execution  of  the  semantic  routines  to  be 
skipped  until  the  corresponding  tuple  (Reduce. 53)  is  encountered.  Of  course  pro¬ 
visions  are  made  to  deal  with  arbitrary  degrees  of  nesting. 

In  a  similar  way.  the  flag  ‘Mskip*  is  used  to  chose  the  appropriate  operand 
in  the  multiplexer  operator.  Finally,  we  note  that  by  restricting  the  operands  of 
the  accumulator  operator  to  sequences  instead  of  sequence  factors,  we  simplified 
greatly  the  action  associated  with  that  operator.  For  a  detailed  description  of  the 
different  semantics  routines,  we  refer  to  the  complete  listing  of  the  program  in 
Appendix  B. 


it  is  important  to  note  that  the  SCE  Interpreter  detects  any  Inconsistency  In 
the  given  equations  or  any  attempt  for  solving  equations  which  are  not  causal  or 
weakly  causal.  It  does  so  by  associating  with  each  sequence  an  entry  In  the 
array  Iast_computed0  to  keep  track  of  the  last  element  that  has  been  computed  in 
the  sequence  so  far.  Any  attempt  to  overwrite  an  already  calculated  element  or  ^ 

to  read  an  element  that  has  not  yet  been  calculated  is  then  easily  detected  and 
reported  as  a  run  time  error.  The  interpreter  also  detects  other  types  of  run-time 
errors  that  are  listed  In  the  function  run.error  In  the  Appendix. 


The  output  part  After  completing  the  Interpretation  of  the  body  of  the  program, 
the  sequences  specified  by  the  user  are  printed  on  the  standard  output  file. 

Finally,  we  note  that  a  possible  optimization  of  the  implementation  of  the 
interpreter  could  be  achieved  by  replacing  the  single  value  stack  by  k  stacks,  for 
some  optimal  k.  This  would  reduce  the  total  number  of  Iterations  through  the 
body  of  the  program  by  considering  at  each  step  the  elements  TIME.  TlME+1.  ••• 

.  TiME+k  of  the  sequences  instead  of  only  one  element  at  a  time.  However,  if 


rn  'a.-v!  frv 


the  system  contains  any  recursion,  then  only  few  of  these  k  elements  (and  in 
many  cases  only  one)  can  be  considered  at  each  step,  and  this  requires  more 
complicated  book  keeping  to  update  the  array  last_computed  and  the  global  clock. 
We  decided  not  to  implement  this  optimization  because  we  intended  the  solver  to 
be  used  in  cases  where  analytical  solutions  of  the  system  of  equations  are  not 
possible,  and  hence  where  recursivlty  is  usually  present. 
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4.  Example:  An  LU  factorization  network 

In  this  section  the  SCE  interpreter  is  applied  to  the  simulation  of  the  com¬ 
putations  of  a  network  for  the  LU  or  the  U7 DU  factorization  of  a  symmetric 
banded  matrix  A  and  the  solution  of  the  linear  system  of  equations  Ax«y  with  a 
given  vector  y.  It  will  be  shown  also  that  with  slight  modifications,  the  same 
network  can  be  used  to  compute  the  Cholesky  decomposition  uJ  of  the  matrix  A. 

The  first  systolic  network  for  factoring  a  banded  matrix  into  the  product  of  a 
lower  triangular  matrix  and  an  upper  triangular  matrix  was  suggested  by  Kung  and 
Leiserson  [8].  Later.  Brent  and  Luk  {91  modified  the  Kung  and  Leiserson  network 
to  compute  the  Cholesky  decomposition  of  symmetric  matrices.  The  network 

described  in  this  section  is  also  designed  for  symmetric  matrices  but  Is  different 

in  its  operation  principle  from  the  one  In  [9].  Both  networks  use  almost  the  seme 

number  of  computational  cells  and  achieve  approximately  the  same  speed-up  over 
serial  execution.  They  differ  however  in  the  type  of  computational  ceils  and  in 

their  interconnections. 

Consider  the  system  of  linear  equations 

A  x  »  y  (4.1) 

where  A  Is  an  nxn  matrix  and  x  and  y  are  n  dimensional  vectors.  The  solution 
x  of  (4.1)  may  be  obtained  by  finding  a  lower  triangular  matrix  L  and  an  upper 
triangular  matrix  U  such  that  A  *  L  U.  and  then  solving  the  two  triangular  sys¬ 
tems  L  z  *  y  and  U  x  *  z.  More  specifically,  assuming  that  A  is  symmetric 
and  banded  with  band  width  2k+1.  and  denoting  the  elements  of  the  matrices  A.  L 
and  U  by  ay  ^  it  j  and  uf  ^  respectively,  we  can  use  the  following  algorithm  to 
compute  the  LU  decomposition  of  A.  Note  that  only  the  elements  a/  j  with  f>i 
are  used  and  that  only  the  non  zero  elements  of  L  and  U  are  computed. 
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ALOI  :  LU  factorization. 

FOR  l»l.  •  •  •  .n  DO 


1)  FOR  j*l.  •  •  •  .  min{n.  l+k}  DO 
11)  '/./  *  •/./ 


1 2>  «/  /  -  ~ 

*1.1 

23  FOR  q«l+1.  •  •  •  .mlntn.  KHd  DO 
FOR  j«q.  •  •  •  .min(n.  l+k)  DO 

V I  *  V/  ‘  "/./ 

At  this  point  wa  nota  that  tha  matrix  L  obtainad  by  tha  above  algorithm 
satisfies  L*UT0.  where  D  is  tha  diagonal  matrix  defined  by  df  t*tf  r  Also,  by 
replacing  steps  1.1  and  1.2  In  ALQ1  by 


V/  •  “u  ■ 


we  obtain  an  algorithm  for  the  Cholesky  decomposition  of  A  into  LL 


After  having  performed  the  LU  decomposition  of  A.  we  may  compute  the 
vector  z*L-1y  by  the  following  algorithm 


ALQ2  :  Back  substitution. 

FOR  1*1.  •••  .n  DO 


FOR  q-l+1.  •  •  •  .min<n,  i-HO  DO 

'9  *  U  -  '»./  */ 


Finally,  the  solution  of  Ux»z  may  be  obtained  by  an  algorithm  similar  to 
ALQ2.  or  alternatively  by  using  ALQ2  itself  for  the  solution  of  U  7  »  7.  where 
7  ■  z  7.  *  xm  ...  and  U  is  a  banded  lower  triangular  matrix  given  by 

/  n  •#  t  l  /  n  •#  t  I 


18 


UIJ  *  un-/t1./i-/+l* 


As  an  example,  let  n*5  .  k»2  and 
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-2  and  y  « 

15 

0 

-3 

2 

-19 

1 

0 
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0 

-2 

1 

14J 

-6 

By  ALQ1  we  obtain 

'2  0  0  0  O' 

4  3  0  0  0 

L  *  6  3-1  0  0 

0-3530 
0  0-2-9  -9 

and  by  ALG2 


2 

-41 

2 

-19 

3 

and  x  * 

27 

-3 

6 

3 

L  3J 

The  graph  of  the  systolic  network  that  executes  ALQ1  and  ALQ2  simultane¬ 
ously  is  shown  in  figure  4.1.  It  is  composed  of  interior  nodes.  Each 

node  is  labeled  by  a  pair  (J.J).  where  I  and  j  are  the  coordinates  of  the  node 
with  respect  to  the  two  axes  shown  in  the  figure.  The  color  of  each  edge  Is 
determined  by  Its  direction,  namely  edges  directed  to  the  east,  south  and  south 
west  are  given  the  colors  s.  b  and  c.  respectively,  and  those  directed  north  are 
assigned  the  colors  r  or  p  depending  on  their  relative  position. 

The  part  of  the  graph  that  is  formed  by  nodes  (i.p.  /*!.••  *.Atl. 
/»!.••  ♦.*-/  +2  represents  a  subnetwork  that  executes  ALG1.  it  consists  of  three 
types  of  nodes  whose  operation  is  described  by  the  following  causal  equations: 
(note  that  the  sequence  associated  with  a  link  y/  t  is  denoted  by  q, 
the  greek  letter  corresponding  to  the  color  of  the  link  y). 


fl 

2 

3 

0 

0 

0 

1 

1 

-1 

0 

and  U  s 

0 

0 

1 

-5 

2 

0 

0 

0 

1 

-3 

0 

0 

0 

0 

1 

where  1?  is 


t  f 

1 —  C,  ,*•> 


For  node  <k+1.1) 

ak.l  *  n0  Ir  7  Ip*+l.l  "  7/f+l.in 

where  r  is  the  unity  sequence  defined  by  rtf)*1.0  for  any  t. 
For  nodes  a.ll.  t»l.  •••  .k 

»/. 2  *  n0  "><.1  -  Vl1 

”/.!  *  n0  lal.  1  *  <»l.  1  *  T/.l"  *  n0  a/.l  '  0|.2 


al- 1.1 

X 

no 

°i.  i 

For  nodes  a 

l.p. 

1*2 

•  •  • 

• 

*/-!./ 

X 

no 

°/./ 

ff/./+ 1 

X 

no 

"u 

p/./  + 1 

X 

no 

pu 

-1 

■  n 

o 

(4.6  .a) 
(4.6.  b) 
(4.6.0 


(4.7.a) 
(4.7. b) 
(4.7.0 
(4.7.d) 


On  the  other  hand,  the  part  of  the  graph  composed  from  the  nodes  (O.j). 


It +1  corresponds  to  a  subnetwork  that  executes  ALG2.  The  operations  of 
the  ceils  in  this  subnetwork  are  described  by 


For  node  <0.1) 


P0.2  *  n0  ^0.1  “  Vi1 

*0.0  *  n0  1<T0.1  *  Ip0.1  ~  *0.l”  *  n0  °0.1  '  p0.2 

For  nodes  (O.p.  J»2.  •  *  •  .k+1 


'0./  +  1 


no  P0 ./ 


*0.1- 1  *  no  wo ./  +  °o./  '  poj> 


(4.8. a) 


(4.8.0) 


(4. 9. a) 


(4.9.0) 


Note  tnat  the  nodes  (/.I).  /*0. •  •  •  .k  correspond  to  subtract/multiply  cells, 
while  the  nodes  (/./).  /*2.*  ♦♦.*+!.  /«1  .♦••.*+2-/.  ore  multlpiy/add  cells.  Only 
the  node  (k+1.1)  is  a  subtract/divide  cell,  in  other  words,  the  network  is  com¬ 
posed  of  three  basic  types  of  simple  computational  cells. 


For  proper  operation  of  the  network,  the  input  sequences  r1  y.  /*l.«**.*tl 
and  Iqai-1  are  set  t0  the  zer0  sequence  i  defined  by  i(f)*0.0  for  ail  t.  and  the 
input  links  a*+2_y  y.  /*!.  •••.*♦!.  are  connected  to  the  links  P*+2_/ /  (see  f,0_ 


ure).  that  is 


/*!.•••  ./ctl 


(4.10.a) 


O.ktl  ‘ 

ak  +2-1 .1  *  nk+2-l.l 


/*  2.  •  •  •  ,k +1 


(4.10.0) 


(4.10.0 


The  elements  of  the  matrix  A  and  the  vector  y  are  fed  into  the  network 
through  the  links  rf  1 .  /*i.*  •  *./r+i  and  rQ  1 .  respectively.  The  precise  input 
specification  is  given  by 


„  _k+l-/  .2 

,i  * n  9  •< 

Jttl  _2  ’ 

p0.1  *  no  9  * 

where  T(ay)«/?-(*  + 1-/).  f(7))*n  and 


/*!.•••  ,k+l 


(4.11.a) 


(4.11.0) 


a/tt)  *  Vmti-/ 

i»tf)  »  yf 

that  it.  contains  the  n-q  elements  of  the  qth  off  diagonal  of  A.  and  n 

the  n  elements  of  the  right  hand  side  vector  y. 

in  order  to  understand  the  principle  of  operation  of  the  network,  we  first 
note  that  at  iteration  step  l  of  algorithm  ALG1.  the  Ith  row  of  U  and  the  Ith 

coiomn  of  L  are  computed  from  the  Ith  row  of  A  (steps  1.1  and  1.2).  However, 
the  elements  of  the  matrix  A  are  continuously  modified.  In  particular,  at  the  exe¬ 
cution  of  step  l.  the  elements  in  row  i  of  A  had  been  modified  by  subtracting 
from  them  different  contributions  (step  2)  during  the  steps  /-*.•  •  *./-l.  In  the 

systolic  network  of  figure  4.1.  the  elements  of  the  unmodified  ith  row  of  A  arrive 
at  the  cells  (q.l).  q*1.*  •♦.**!.  on  the  r  colored  links.  At  the  same  time,  the 
sum  of  the  contributions  from  the  previous  Iterations  /-*.•••./- 1  arrive  at  the 

same  cells  on  the  c  colored  links.  The  subtraction  is  then  performed  and  the 
elements  of  the  corresponding  coiomn  and  row  of  L  and  U  are  computed  and 

sent  out  on  the  r  and  p  colored  links,  respectively.  These  elements  propagate 
upward  in  the  network  allowing  the  cells  (q./>.  /  »2.  •  •  •  .k +1 .  q*1,*  •  *.k+2-/  to 
compute  and  sum  the  contributions  for  the  modification  of  the  subsequent  rows  of 
A.  These  contributions  are  sent  downward  on  the  c  colored  links.  Finally,  the 
subnetwork  formed  by  the  ceils  (0./).  /»1.--*.A+1  operate  in  a  similar  way. 

A  careful  study  of  the  behavior  of  the  network  shows  that  the  significant 

elements  of  the  matrix  U  are  sampled  from  the  links  Pq  y  q=l.*  ••.ft.  and  the 

elements  of  the  partial  solution  vector  z  from  bQ  Q .  These  results  are  sufficient 

for  the  computation  of  the  solution  x*U~^z.  However,  the  elements  of  the  diago¬ 
nal  matrix  0.  where  L*UTD.  are  also  available  on  the  link  sk  ^ .  More  precisely, 
.ne  output  sequences  are  expected  to  have  the  following  description; 
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iTy  1  *  n*+2  q  02  tig  g*1.*«*.*  (4. 12. a) 

*♦2  2 

iJ0  0  •  n  6*  c  (4.12.M 

On  »  fl  02  H  (4.12.0 

where  r(A^)*/»-(*+l-q).  T(0*T(X)*n  and 

V*  *  Vrt**i-q 

Xtt)  -  dt  t 

C(y)  •  zt 

Aft ar  tha  computation  of  U  and  z  terminates,  we  may  use  the  network  for  a 

second  time  to  solve  £77*7  and  obtain  the  vector  x.  Of  course  only  few  cells  will 

be  doing  useful  work  during  this  second  run.  At  this  point  we  note  that  we  can 
add  to  the  network  any  number  of  columns  of  ceils  identical  to  the  column  (0./). 
/*1.«  •  This  enables  us  to  use  the  network  to  solve  (4.1)  for  more  than 

one  right  hand  side  vector  y  simultaneously. 


Finally,  we  note  that  the  network  described  in  this  section  can  be  modified 

r  r 

to  perform  the  Cholesky  decomposition  LL  instead  of  the  U  OU  decomposition. 
For  this,  equation  (4.5)  that  describes  the  operation  of  node  (k+1.1)  in  the  network 
have  to  be  replaced  by 


°k.  l  x  n0  * 

and  the  data  on  the  links  p/ 2>  /■!.••  •.*  have  to  be  set  equal  to  the  data  on 
ni  2‘  /*1'**'-*-  This  ha*  effect  of  modifying  (4.7.d)  to 

*/+l./-l  *  n0  lyl.l  *  *l.i  ai.f 

It  Is  clear  that  In  this  case,  the  links  rf  ^  / *2. /*!.••  •,*+2-/  carry  redun¬ 


dant  Information  and  hence  can  be  removed  from  the  network. 


After  this  description  of  the  network,  we  turn  our  attention  to  the  task  of 
simulating  its  operation.  First,  we  write  the  SCE  program  that  describes  the  net¬ 


i 


work  and  contains  the  equations  that  model  its  nodes.  In  the  following  program, 
the  parameter  k  which  determines  the  size  of  the  network  is  set  to  2. 

The  SCE  program  for  the  network  of  figure  4.1 


PAR  k=2  ; 

INDEX  i.j  : 

SEQN  s(0:k.1:k+l)  . 
r(0:k+1,1:k+2}  . 
pC1:k.1:k+2)  . 
c(l:k+1.1:k+1)  . 
b{0:0.0:k+1}  : 


m 


* 

INPUT(  MAXT  18.  For  l=0.k+1  rU.1)  >  ; 

/* 

input  statement 

V 

H 

c 

m 

FOR  j=1.k+l  DO  cfl.j)  =  o  END  ; 

/* 

equation 

<4.10.a) 

V 

btO.k+l)  =  o  ; 

/* 

equation 

(4.10.P) 

V 

•\v 

•  ■  * 

sfk.  1}  *  Z  [  u  /  tr(k+1.1)  -  cfk+1.1)]  ]  ; 

/* 

equation 

(4.5)  */ 

1 

FOR  1=1. k  DO 

m 

r{i.2J  =  Z  I  r(l.l)  -  cCI.1}  J  ; 

/* 

equation 

(4. 6.  a) 

V 

.  •*. 
v_ 

p(i.2)  =  Z  s(l.l)  *  r(i.2)  ; 

/• 

equation 

(4.6. b) 

V 

' 

sCi-1.1)=  Z  3(1.1} 

/« 

equation 

(4.6.0 

•/ 

ENO  ; 

•  ^ 

■ 

FOR  j=2.k+l  DO 

m 

FOR  l=1.k+2-j  DO 

iiS'j 

s(l-l.j)  =  Z  s(i.j)  ; 

/* 

equation 

(4.7.a) 

V 

;V- 

V 

p(I.J+1}  =  Z  p(I.J)  ; 

/* 

equation 

(4.7.0 

V 

rd.j+1)  =  Z  rtl.j}  : 

/* 

equation 

(4.7.C) 

*/ 

.  ^  t 

Cd+I.j-D  =  Z  I  c(l.j)  +  rd.J)  *  sd.j)  ] 

/* 

equation 

(4.7. d) 

V 

. 

END  ; 

w 

s(k+2-j.j)  =  p(k+2-J.j} 

/• 

equation 

(4.10.0 

V 

.•-7t 

END  ; 

r(0.2)  =  Z  (  rCO.  1}  -  b(0.1}  }  ; 

/* 

equation 

(4.8. a) 

V 

►  •  »  i 

. 

r 

b(0.0}  =  r(0.2)  *  Z  s(O.l) 

/• 

equation 

(4.8.0 

V 

r-“ 

% 

FOR  )=2.k+1  DO 

1  f* 

r(0.j+1)  =  Z  r(O.J)  ; 

/* 

equation 

(4.9.a) 

V 

b(O.j-l)  =  Z(2)  (  b(0,j}  +  s(O.j)  *  r(O.j)  J 

/• 

equation 

(4.9.0 

V 

» • . 

END  ; 

L 

OUT(  b(0.0)  .  FOR  i=1.k  pd.2)  .  s(k.l)  )  ; 

/• 

output  statement 

V 

\*.v 

L 


•  j 
-J 
*-■  "J 


2 


Next,  we  will  use  the  above  program  to  simulate  the  computation  of  the 
matrices  L.  U  and  the  vector  z  for  the  matrix  A  given  in  (4.2).  In  order  to 
specify  the  input  for  this  computation,  we  note  that  The  INPUT  statement  in  the 
above  program  limits  the  length  of  the  sequences  to  18  elements,  it  also  deter¬ 
mines  the  order  in  which  the  input  sequences  are  read  from  the  input  file, 

namely  pQ  y  p1  r  p2  1  and  p3  y  respectively.  Accordingly,  we  follow  the  pattern 
specified  by  (4.11. a/b).  and  use  the  data  from  (4.1).  to  construct  the  following 
input  file 

Input  file: 

0.0  0.0  0.0  4.0  d  d  14.0  d  d  15.0  d  d  0.0  d  d  -6.0  ... 

d  d  6.0  d  d  -3.0  d  d  -2.0  d  d 

d  4.0  d  d  15.0  d  d  2.0  d  d  1.0 

2.0  d  d  11.0  d  dT  20.0  d  d  -19.0  d  d  14.0  ... 

Finally,  we  use  the  SCE  interpreter  to  run  the  above  program  with  the  given 
input,  this  produces  the  following  output  file 

*«*«  OUTPUT  SEQUENCES  **** 

0.00  0.00  0.00  0.00  2.00  d  d  2.00  d  d  3.00  d  d  -3.00  d 

d  3.00  d 

0.00  d  d  3.00  d  d  -1.00  d  d  2.00  d  d  d  d  d 
d  d  d 


0.00  d  2.00  d  d  1.00  d  d  -5.00  d  d  -3.00  d  d  d 
d  d  d 


0.00  0.50  d  d  0.33  d  d  -1.00  d  d  0.33  d  d  -0.11  d 
d  d  d 

where  as  specified  by  the  OUT  statement,  the  sequences  are  printed  in  the  order 
t>00*  ^i2-  nz  2  and  °k  1’  respectively.  It  Is  easy  to  verify  that  this  output 
agrees  with  the  results  In  (4.3)  and  (4.4).  and  the  formulas  (4.12.a/b/c). 


-  25 


5.  Conclusion 

In  this  paper,  we  presented  an  equation  solver  to  supplement  the  formal 
systolic  model  discussed  in  [21.  This  solver  is  intended  to  be  used  in  particular 
for  the  verification  of  systolic  networks  in  the  cases  whsre  the  resulting  system  of 
equations  modeling  the  network  cannot  be  solved  analytically. 

The  application  of  this  solver  to  the  verification  of  a  computation  of  a  sys¬ 
tolic  network  is  equivalent  with  the  simulation  of  its  operation  for  specific  input 
data.  More  specifically,  the  user  specifies  the  architecture  of  the  network  to  the 
solver/simulator  in  the  form  of  an  SCE  program  that  consists  in  essence  of  the 
system  of  equations  modeling  the  network  in  a  computer  readable  form.  This 
program  is  then  interpreted  by  the  solver  and  the  output  data  are  computed  for 
specific  inputs. 

It  should  be  clear  that  the  soiver/simulator  can  only  be  used  to  verify 
specific  instances  of  computations  on  networks;  that  is  the  input  data  have  to  be 
given  specific  values  during  the  simulation.  This  limitation  applies  to  any  simulator 
or  numeric  solver,  and  the  only  way  to  allow  for  a  more  general  simulation  would 
be  to  consider  symbolic  manipulation  and  to  obtain  a  symbolic  description  of  the 
outputs  in  terms  of  the  inputs. 

The  potential  application  of  the  SCE  language  presented  in  this  paper  is  not 
limited  to  the  solver/simuiator.  Namely,  the  SCE  language  may  be  used  to  specify 
precisely  any  systole  network  that  can  be  described  in  terms  of  the  abstract 
model,  in  fact,  for  a  given  network,  one  may  write  an  SCE  program  in  which  the 
causal  equations  and  the  sequence  declarations  describe  completely  the  graph  of 
the  network  as  well  as  the  operation  of  each  of  Its  cells.  This  SCE  specification 
may  be  used,  for  example,  as  an  Input  to  an  automatic  lay-out  program  or  to  a 
translator  that  generates  specifications  in  other  languages  used  in  computer  aided 
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design  of  VLSI  devices. 


The  syntax  directed  approach  used  in  the  implementation  of  the 

solver/simulator  led  to  a  very  modular  program.  This  simplifies  the  task  of  modi¬ 
fying  the  solver  to  incorporate  new  SCE  grammar  rules  that  need  to  be  added 

when  new  types  of  sequence  operators  are  Introduced.  Actually,  the  addition  of  a 

new  sequence  operator  to  the  grammar  requires  only  the  implementation  of  a 
corresponding  semantics  routine  that  describes  the  effect  of  the  operator. 

Finally,  we  note  that  the  solver  was  extensively  tested  in  the  solution  of 
many  systems  of  equations,  including  those  resulting  from  the  networks  verified 
analytically  in  Meihem  Rheinboidt  and  .  Meihem  Stiffness  in  this  paper,  we 
applied  the  solver  only  to  the  verification  of  a  network  which  we  designed  for  the 
LU  factorization  of  symmetric  banded  matrices  and  the  solution  of  the  correspond¬ 
ing  system  of  linear  equations.  This  example  shows  the  simplicity  and  effective¬ 
ness  of  our  approach  for  the  simulation  of  systolic  networks. 
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Appendix  A:  The  grammar  for  the  3CE  language. 


Terminal  symbols 


PAR 

INOEX 

SEQN 

FOR 

DO 

END 

MAXTIME 

INPUT  OUT 

Comma 

Semi 

Colon 

Equal 

Plus 

Minus 

Muit 

Div 

Lbrak 

Abrak 

Lcurl 

Rcurl 

Lpar 

Rpar 

Period 

0 

Z 

T 

A 

E 

M 

U1 

U2 

Identifier 

Positivejnteger 

Positive.real 

Grammar  rules 


1)  <prog>  <declare>  <ln_part>  <body>  <out_part> 

2)  <deciare>  <par_decl>  <index_deci>  <seqn_deci> 

/•  PARAMETER  DECLARATIONS  ■/ 

3)  <par_decl>  PAR  <par_llst>  Semi 

4)  I 

5)  <par_list>  <par_llst>  Comma  <par_stmt> 

6)  l  <par_stmt> 

7)  <par_stmt>  identifier  Equal  Positive.integer 


/*  INOEX  DECLARATIONS  "/ 

8)  <index_decl>  INDEX  <l_liat>  Semi 

9)  I 

10)  <l_llst>  <1 llet>  Comma  identifier 

11)  l  identifier 


/*  SEQUENCE  DECLARATIONS  «/ 

12)  <seqn_decl>  SEQN  <dim_iist>  Semi 

13)  <dim_list>  <dim_list>  Comma  <seqn_dlm> 

14)  I  <seqn_dim> 

15)  <seqn_dlm>  Identifier  Lcurl  <range_iist>  Rcurl 

16)  l  Identifier 

17)  <range_llst>  <range_list>  Comma  <range> 

18)  I  <range> 

19)  <range>  <i_expr>  Colon  <l_expr> 


/“  INDEX  EXPRESSIONS  ■/ 


20) 

21) 

22) 

23) 

24) 

25) 
25) 

27) 

28) 
29) 


<l_expr> 


<l_term> 

<l_factor> 


<simple_factor> 


<l_expr>  Plus  <l_term> 
<l_expr>  Minus  <l_term> 
<l_term> 

<l_term>  Mult  <i_factor> 
<l_factor> 

<simplo_factor> 

Minus  <simple_factor> 
Lpar  <l_expr>  Rpar 
identifier 
Positlvejnteger 


/*  THE  BODY  OF  THE  PROGRAM  «/ 


30)  <bOdy> 

31)  <stmt_list> 

32) 

33)  <stmt> 

34) 

35)  <for_stmt> 

36)  <for_spec> 

37)  <eqn> 


<stmt_list>  Semi 
<stmt_llst>  Semi  <stmt> 

I  <stmt> 

<eqn> 

I  <for_stmt> 

FOR  <for_spec>  <stmt_llst>  END 

Identifier  Equal  <i_expr>  Comma  <l_expr>  DO 

<seq_speo  Equal  <seq_expr> 


/*  SEQUENCE  SPECIFICATIONS  •/ 


38)  <seq_speo 

39) 

40)  <indicat_list> 

41) 


Identifier  Lcurl  <lndicat_Ust>  Rcurl 
l  identifier 

<lndlcatjist>  Comma  <i_expr> 

I  <l_expr> 


/*  ELEMENT  WISE  OPERATORS  ON  SEQUENCES  V 


42)  <seq_expr> 

43) 

44) 

45)  <seq_term> 

46) 

47) 

48) 

49) 

50)  <s_factor> 

51) 


<seq_expr>  Plus  <seq_term> 

I  <seq_expr>  Minus  <seq.term> 

I  <seq_term> 

<seq_term>  Mult  <s_factor> 

I  <seq_term>  Div  <s_factor> 

l  <seq_term>  ui  <s_factor> 

I  <seq_term>  U2  <s_factor> 

I  <s_fector> 

Posltive.reai  Period  <seq_factor> 
l  <seq_factor> 


/*  OPERATORS  DEFINED  DIRECTLY  ON  SEQUENCES  "/ 


52) 

<seq_factor> 

<seq_spac> 

S3) 

I 

<simple_op>  <aeq_factor> 

54) 

1 

<multipiex_op>  Lpar  <cholce_list> 

Rpar 

55) 

1 

A  Lcurl  <i_expr>  Comma  <i_expr> 

Comma 

<Lexpr>  Rcuri  <seq_spec> 

56) 

1 

Lprak  <seq_expr>  Rbrak 

57) 

<multipiex_op> 

M  Lcurl  <l_expr>  Semi  <ratlo_llst> 

Rcuri 

58) 

<simple_op> 

0  <op_power> 

59) 

1 

Z  <op_power> 

60) 

I 

l 

T  <op_power> 

61) 

I 

E  Lcurl  <l_expr>  Comma  <l_expr> 

Rcuri 

62) 

<op_power> 

Lcurl  <l_expr>  Rcuri 

63) 

1 

64) 

<cholce_iist> 

<choice_llst>  Comma  <seq_axpr> 

65) 

1 

<seq_expr> 

66) 

<ratlo_llst> 

<ratlo_llst>  Comma  <l_expr> 

67) 

1 

<i_expr> 

/*  INPUT 

SPECIFICATIONS  */ 

68) 

<ln_part> 

INPUT  Lpar  <lnp.Jlst>  Rpar  Semi 

69) 

<lnp_l}st> 

<lnp_liat>  Comma  Onp_spec> 

70) 

1 

MAXT  Positivejnteger 

71) 

<lnp_speo 

•  • 

<seq_speo 

72) 

1 

FOR  <lo_for>  <inp_spec> 

73) 

<lo_for> 

Identifier  Equal  <l_expr>  Comma 

<l_expr> 

/*  OUTPUT  SPECIFICATIONS  V 

74) 

<out_part> 

OUT  Lpar  <out_llst>  Rpar  Semi 

75) 

<outJlat> 

<out_Hat>  Comma  <ouL.apec> 

76) 

1 

<out_spec> 

77) 

<out_spoc> 

<seq_speo 

78) 

1 

FOR  «lo_for>  <out_spec> 

AfMfldix  B:  The  listing  of  the  SCE  interpreter  program. 


/%%  GLOBAL  DECLARATIONS  **/ 
•include  <stdio.h; 


•define  Prog-length 
•define  N.rules 
•define  N.'jufts 
♦define  N.syabol 
•define  N.bound 
•define  N.seon 
•define  Maxtiae 
•define  stack.ienath 
•define  N-words 
•define  N_ ratios 
•define  N.real 


630 

78 

ft 

nuaber  of  rules  in  the  granger 

tf 

24 

ft 

rules  not  reouirins  any  action 

1/ 

20 

ft 

size  of  the  syabol  table 

*/ 

20 

ft 

size  -of  the  bound  table 

tf 

129 

ft 

aax.  nuaber  of  seouences  used 

tf 

30 

ft 

ueeer  liait  on  siaulation  tine 

tf 

40 

ft 

length  of  working  stack 

tf 

< (Maxtiae  M)  /  14  )  M 

5  /*  aax.  •  of  arguaents  in  M  operators 

5  /%  tax.  •  of  real  constants  used 


*/ 

«/ 


int  overflowC33  3  <  ?rog_length>  stack.lensth’  N.sygbol’  N_bound» 

N-seoni  Maxtiae  +  1  »  N_ratios»  N-real  >  » 


FILE  *foe«n<)  »  *fp  I 

/t  an  array  to  store  the  erosraa  triples  t/ 
struct  <  char  action  5 

int  value  l 

int  toe  i  >  erosCPros_l*nSth3  5 

int  location  f  /%  pointer  to  pros  array  %/ 


ft  aduuslti]  contains  the  length  of  the  R.H.S.  of  sraeear  rule 
i  sinus  or*’  which  is  the  adJustsent  in  the  toe  of  the  stack  XI 
int  adJustCN-rulesl  3  <-3»  -2»  -2»  1*  -2»  0»  -2 t  ~2t  1»  -2 » 

0»  -2»  *2»  0»  -3»  0»  -2 ’  O’  -2 r  -2 » 

-2»  0»  -2»  0»  0»  -1»  -2»  0»  0»  -1’ 

-2 »  0»  0»  0»  -3’  -5»  “2’  -3»  0»  “2’ 

0*  -2>  -2»  0»  ~2»  -2»  -2»  -2>  0»  -2’ 

0*  0»  **i»  -3»  -8»  -2»  *5’ 

-5.  -2.  1» 


•l»  -1 t  -1» 
0»  -2 »  0’  -4»  -2»  -1» 


>» 


0»  -2»  -4»  -4»  -2»  0»  0»  -2 


ft  dussC3  contains 
any  action  tf 
int  dunsCN-duas3 


the  nuaber  of  the  sraaaar  rules  not  reouirins 

3  <  3»  4’  S’  6»  8i  9»  12’  13’  14’  22’ 

.  24»  23’  29’  31’  32’  33»  34»  44»  49’  31’ 

69’  74»  73’  76 
>J 


int  stackCstack_lensth3  5 
float  fstackCstack-lensthl  » 

ft  SYMBOL  TABLE  •  tyee*  Si  P  or  I  for  seon. 
struct  <  char  tyee  f 
int  entryll 

int  entry2>  >  sya_tsbCN_syabol3  } 

int  lboundCN_boun>13  i 
int  uboundCN_bound3  i 
int  ran_etr  3  -1  I 


/*  SEQUENCE  STORAGE  tf 
rioat  seo_storeCN_seon3CMaxtiae+13  I 

unsigned  d_tableCN.seon3CN-words3  I 

int  seo_etr  3  0  I 
int  sea.size  I 

ini  last.coaeutedCN.seonl  I 

float  r.storeCN_real3  I 

int  r.etr  3  -1  > 


int  ratio.teaeCN.ratiosl  I 
int  ratio-etr  I 


/*  dvnaaic  working  stack  t/ 
ft  a  Matching  value-stack  %/ 

earaaeter  or  index’  reseectively.  f/ 


/%  lower  bound  table  */ 

/%  ueeer  bound  table  %/ 

/%  eointer  to  bound  table  %/ 


It  seouences  storage  %/ 
l X  keees  tracks  of  don't  cares*/ 
/*  eointer  to  seeuence  store  */ 

/*  actual  si<:e  of  the  table.  */ 

/%  for  consistency  checks  */ 

/%  storage  for  real  constants  %/ 
ft  the  correseonding  eointer  */ 

ft  to  store  eultielexer  ratios  %/ 
ft  a  eointer  to  ratio.teae  */ 


I 


int  not. done 


ft  a  looe  control  variable  tf 


32 


/*  THE  MAIN  PROGRAM  </ 

MiftO 


char  action  * 
int  value  * 

int  to*  *  -1  I  /%  currant  to*  of  stack  %/ 

int  J*  lookins  * 
int  staaa  ) 

int  and.staaeCSl  * 

end_staaeC13«2  f  end_staaeC23»A8  * 
and_stasaC33*30  $  tnd_staaeC43»l  * 

ft  o*an  the  fil*  that  contains  tha  rroaraa  tu*las  t / 
f*  *  fo*en(‘out.*arse**  *r*)  > 
for(saa.*tr«0  >  *eo_*tr  <  N.seen  >  see.*tr++) 
for<j»0  )  J  <  N.words  I  J++) 
d_tableCseo.*tr3CJ3  *  0  I 
seo-*tr  -«  0  5 

it  Build  standard  entries  in  syebol  tablet  they  can  be 
overwritten  by  *ro*er  declaration  %/ 
ror(j«0  I  j  <  3  I  j H) 

<  sy»_tabCJ3.ty*e  ■  'S'  I 
svflutabCJI.entrvl  »  seo.*tr  * 
last_coe*utedCsea_*tr++3  a  Haxtiae  * 

*y»_tabCJ3.entry2  *  -1  *  /*  indicating  a  sinale  seouence  */ 

> 

for(j»3  )  j  <  N.syebols  J  J++) 
sve_tabCJ3.ty*e  •  '  '  I 
for(J«l  t  j  <  haxtiae  *  j++) 

<  see.storeCHCJ]  «  0.0  ) 
aea_storeC23CJ3  >1,0  *  > 

for( j*0  >  j  <  N_wards  *  J++)  . 
d_tableC03CJ3  »  0177777  I 

for(j*3  J  J  <  N_seon  *  J++) 
last_coe*utadCJ3  *  0  I 

ft  END  OF  INITIALIZATION  AND  BEGINNIG  OF  MAIN  LOOP  */ 

ft  Outer  for  loo*:  stade*l  ->  declarations* 
stase*2  ->  in*ut  *art* 
stase*3  ->  * roar a*  body* 
staae*4  ->  out*ut  *art.  %/ 
for(staae*l  i  stase  <»  4  ?  Hstase) 

< 

location  * 0  f 
no t_ done  »  1  * 

/t  inner  loo*  l:  read  the  tu*les  for  the  corres*ondina  stase 
fro*  file*  kee*  track  of  the  to*  of  the  stack  and  save  in 
*roaC3  only  those  tri*les  that  reauire  a  certain  action  %/ 
do 

< 

fscanf(  f*  *  ’Sc*  *  taction  )  ) 
if (action  !»  'L')  fscanf(  f*  *  */W\n*  *  tvalue  )  J 
else  <  check(7*  ++r.*tr)  * 

fscanf (  f*  *  *%f\n'  *  tr_storeCr.*tr3  )  )  > 
switch  (action) 

< 

case  'R'  5  <  lookina  »ll  j»0l  /*  REDUCE  */ 

while< lookina  it  J  <  N,duas) 

if(value  *■  duesCJ++3)  lookina  *  0  I 
if(lookina)  *ush_tri*le( action*  value*  to*)  ) 
checkd*  (to*  adJustC value-1 3)  ) * 
if (value  >a  end.staaeCstaae3)  not_done«0  ! 
break  > 


ft  REDUCE  %/ 


<  checkd*  ++to*)  ) 

*ush_tri*le( action*  value*  to*  )) 
break  I 


/*  SHIFT  IDENTIFIER  tf 
/ *  OR  INTEGER  CONSTANT  tf 


•*.  -r. 


'V* 


/*  SHIFT  REAL  CONSTANT  */ 


case  'L' 


cis<  S' 


case  'A' 


> 


:  <  checkti*mor>  i 
rush.trirleUction* 
break  I 

> 

*  <  check(l*++tor)  * 
break  J 

\ 


:  <  check(l*  Mtop) 
rush.trirleiactxon* 
braak  * 


> 

while (not.dona)  * 


r_ptr*  top)  * 


value*  top 


* 

)* 


/*  SHIFT  %/ 
/%  ACCEPT  */ 


location  »  -1  i 
not.dona  *  1  * 

ft  innap  loop  21  exacute  the  action  routines  f or  that  stafe  %/ 
while (not. dona) 

< 

^location  i 

value  *  rrofClocation!. value  * 
top  *  rrofClocation!, tor  * 
switch(  prof [location!, action  ) 

< 

case  ' C '  J  <  stackCtor!  *  value  *  break  *  > 

case  'l'  1  <  fstackCtor!  »  r.stareCvalue!  )  break  *  > 

case  'R'  1  <  semantic ( value*  top)  >  braak  *  > 

case  'A'  t  not.dona  *  0  I 

> 

> 

> 

fclose(fp)  ) 

> 

/%  END  OF  THE  MAIN  PROGRAM  %/ 


/% 

A  routine  to  store  a  triple  in  the  array  rrofC! 
*/ 

rush-trirleia  *  v*  t)  char  a  *  int  v*t  > 

< 

rrofClocation!. action  »  a  * 
prof Clocation!. value  *  v  ) 
prof (location!. top  *  t  ( 
check<0*  Hlocation)  ) 

l•eUlm<0)  ) 

> 
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,XXXXXtXXXXX-XXXtXXXXXX2XXXXXX-*XTTtXXXttXXXXtXXXXXXXXXXXXXXXXXXXXXtXXXXXXXXX/ 

ft  THE  SEMANTICS  ROUTINES  1/ 

/■xttttttttttxttttxxxxxxtttttxttxtttttttttxtxttxtxxxtxtxttxxxxtttxxxtxtxxtxtf 


int  declarins  •  1  > 
int  ski?  »  0  } 
int  Mski?»  0  i 
float  setfloat*)  ♦ 
float  tfloat  i 
int  TIME  *  1  ; 
int  d_flas  > 


ft  *1  only  durina  declaration  */ 

ft  to  ski?  calculation  in  case  of  don't  cares  1/ 

ft  to  chose  the  argument  in  M  operator  %/ 

ft  teerorary  variable  tf 

ft  alobal  systee  tine  tf 


semantic (rule*  to?)  int  rule* tor  * 

\ 

int  tOi  t2i  resdins  i  J  • 

if (Mski?) 

<  switch* rule) 

<  case  54  :  break  S 

case  54  :  — stackCto?-43  5 

case  45  t  —Mski?  i 

default  5  return* 0)  ) 

> 


if (ski?) 

<  switch* rule) 
<  case  53  J 
case  54  : 
case  57  t 
case  58  : 
case  57  : 
case  40  *. 
case  41  ' 


<  —ski?  i  return* 0)  i  > 


<  ski?++  i  return(O)  i  > 


default  t  return(O)  i 


switch* rule) 

< 

case  1  5  <  not_done  *  0  i  return* 0)  i  > 

case  2  :  <  declarins*  0  f 

not-done  *  0  i 

sea. size  *  sea_?tr  -  1  »  reti 

't  PARAMETER  DECLARATION  */ 


*0)  »  >  ft  sisnal  end  of  stase  4  tf 

ft  sisnal  end  of  declarations  tf 
ft  that  is  stase  1  tf 

1  i  return(O)  »  > 


case  7  i  <  t2  *  stackCtor-23  i  check* 2it2)  > 
if < (t2  >  2)  II  (sy«i_tabCt23.ty?e  !* 
sy»-tabCt23.entryl  *  stackCto?3  i 

sye-tabtt23. twee  *  'P'  i 

return<0)  )  3- 

t  INDEX  DECLARATIONS  tf 

case  10 1 

case  11 :  <  to  *  stackCto?3  i  check(2it0)  » 

if* (tO  >  2)  II  (sy»-tabCt03.ty?e  !> 
sy«»_tabCt03.entryl  *  0  ) 

sy«_tabCt03.entry2  *  0  i 

sye_tabCt03.ty?e  *  'I'  i 

return<0)  t  > 

t  SEQUENCE  DECLARATIONS  tf 

case  13?  <  check* 3 1  k+ran-rtr)  ) 
lboundCran-?tr3  ■  0  ) 
uboundCran.?tr3  ■  0  ) 
see.?tr  »  seo_?tr  +  stack [to?- 13  i 
check<4»  sea-?tr)  ) 
return(O)  >  > 

case  14*.  <  tO  >  stackCto?3  I 

if* (tO  >  2)  It  <sy»_tabCt03.ty?e  !* 
sye-tabCt03.ty?e  ■  '3'  ) 

sye-tebCt03. entry 1  «  seo-rtr  ) 

sy»-tabCt03. entry*  ■  -1  I 

check*4»++s#o-?tr)  > 

return(O)  >  > 


’)>  run_error*15)  i 


))  run-error* 13)  i 


))  run-error* 15)  ) 


C3se  57: 


s  stack! top-l]  • 

if(TiME  <  t2)  (.3kiP8l  I 

stack! top-5]  =  1  ; 
fstack!top-53  *  0  i  return(O) 
t2  8  (TIME  -  t2)  7.  tO  i 
t or(J=0  t  t2  >8  ratio_temp!J3  »  j++)  s 
stack!top-53  8  rstio.Ptr  J 
Mskip  * 


return(O)  »  > 


/*  cardinality  of  list* 
/*  chosen  element  * 


8  1 


case  58:  <  tO  8  stack! top]  i 

if (TIME  <=  t0>  <  skiP=l  ; 

stack! top-1]  .  . 
fstackCtop-t]  8  0  5  return(O) 
stackCtop-l]  8  TIME  i 
TIME  8  TIME  -  tO  5 

return(O)  i  > 

case  59:  <  to  8  stackCtop]  » 

if (TIME  <»  tO)  <  skip  8  1  ) 
stack! top-1] 

,  ,  f stack! top-1]  8 

stackCtop-l]  8  TIME  5 


0 

0 


return(O) 


TIME  8  TIME  -tO 
return<0)  »  > 

case  60:  <  tO  8  stack! top]  J 

t2  8  (TIME  +  tO)  X  (1  +  tO)  I 
if ( t2)  <  skip  »i  i 

stack!top-l]  =  l 
fstackCtop-1]  8  0 
t2  8  (TIME  +  tO)  /  (1  +  tO) 
stack! top-1]  8  TIME 
TIME  8  t2 
return (0)  »  > 

tO  8  3tack!top-l]  i  t2  8  stackItop-3D  » 


/*  don't  care  */ 


return(O)  i> 


case  61 J 


if (TIME 


case  62: 
case  63! 
case  64: 


<  skip  8  1  t 
stack! top-5]  8  1  * 

.  in  fstack!top-3]  8  0  ;  return(O)  i  > 

stack! top-5]  8  TIME  } 

TIME  8  TIME  -  ((TIME  -  t2>  X  tO)  J 
return(O)  5  > 

<  stack!top-23  8  stack!top-13  i  return(O)  $  > 

<  stack !tor+l]  8  1  ;  return(O)  i  > 

■C  —stack!  top-4]  » 

stack! top-23  8  stack!top]  J 
f3tack! top-23  8  fstack!top3  » 

•C-  —  Mskip  5 
retijrn(O)  »  } 
i  check <6»  ++ratio_ptr)  ? 
stack!top-23  t8  stack!top3  » 
ratio. temp! ratio-Ptr]  8  stack!top-23  f 
return(O)  »  > 

case  67:  <  ratio-Ptr  8  0  5 

ratio. temp!03  8  stack!top]  i 
return(O)  »  > 


case  63: 
case  66: 


E  INPUT  SPECIFICATIONS  1/ 

case  68:  !  not.done  8  0  5  /*  end  of  stase  2  */ 

return(O)  »  > 

case  70:  {  stackll]  8  stackltop]-  i 
check(5>  stack! top 3)  i 
return(O)  f  > 

cast  71 :  <  tO  =  stackttop]  »  readins  ■  l  » 
for(J8l  ?  j  <8  stack!13  )  j++) 
if(readins) 

■C  seo.store!t03!J3  8  setfloat(d_flas>  i 
if(d.flas  88  1)  urite.d(tO» j)  } 
if(d.flas  «  -l)  <  readins  8  0  » 
y  write_d(tO> j)  J  > 

else  write.d(tO»J)  f 
last.coeputedltO]  8  stack!13  I 
return(O)  »  > 

case  72*.  <  tO  8  stack!toe-l]  t 

if(ss»rtablt03.entryl  >8  s*».tab!t03.entrw2>  return(O)  i 
++ssm-tab!t03.entrsl  i 
location  8  stackItop-23  i 
return(O)  *  > 


0339  ' w « 


t2  3  st3ckCtop-4]  5  check (2*t2)  i 
if (sy»_tabCt23 .type  !=  'I')  run.error(6)  * 
sye_tabCt2] .entryl  3  stack! top-2]  > 
syi>-.tabCt21  .entry2  =  stackCtop]  > 
st3ckCtop-53  3  locstion  » 
return(O)  )  > 


/*  OUTPUT  SPECIFICATIONS  */ 


C3»e  77:  <  tO  3  stackCtop]  • 

for<j*i  ;  j  o  stsckci]  ;  j++) 
if ( read_d(tO»J) )  printfC  d  * )  * 
else  printf < *"5.2f  * >  seo-storeCtOTUl)  * 
printf < *\n  XXXXXXXXXXXXXXXXXXXXXXX\ n*)  5 
return(O)  f  > 

esse  73:  <  tO  3  stackCtop-ll  » 

if (sye-tabCtOl. entryl  >=  sy*-tabCt03 .entry2)  return(O) 
k+sya_tabCtO] .entry!  5 
location  3  stack! top-23  » 
ptturn(O)  !  > 

> 

1 

/%  END  OF  THE  SEMANTICS  ROUTINES  X/ 


/xxtxxxxxxtxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/ 

/X  User  Defined  Operators  X/ 

/xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/ 

ft 

These  routines  are  provided  by  the  user  to  define  the 
binary  operators  U1  and  U2.  The  operands  are  passed  in 
ol  and  o2  and  the  result  is  returned  in  r. 

If  any  of  the  operands  is  the  don't  care  svebol*  tl  or  t2 
is  set  to  1*  correspondingly*  otherwise  they  are  set  to  0. 
The  return  value  of  the  functions  should  be  0  if  the 
result  of  the  operation  is  not  a  don't  care  and  1  if  it  is. 
%/ 

u_orl(ol*tl>o2>t2>r)  int  tl»t2  > 

float  ol*o2*  *r  I 

< 

/*  roreulas  for  u.opI  and  r  </ 


u_op2<o1* tl*o2* t2* r)  int  tl*t2  i 

float  ol#o2*  *r  > 

< 

/%  Formulas  for  u-op2  and  r  X/ 

/ 


/xxxxxxxxxtxxxxxtxxxxxxxxtxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/ 
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/x**xx*mmx*xm****xmx*xx*x*xxxm*x***#*x#xx**m*x*#*x#xxx**/ 

/* 

The  following  routine  reads  the  next  item  from  the 
input  data  file.  It  assumes  that  one  of  the  following 
exists  on  the  file! 

15  3  floating  point  number* 

25  3  'd'  »  indicating  3  don't  care  item* 

3)  or  5  indicating  that  the  remaining  items  in 

the  current  serouence  sre  don't  cares. 

The  different  C3ses  3re  signaled  by  tine  global  flag  d-flsg 
%/ 

float  getflostO 

< 

int  Cf  int_?3rt  5 
int  minus  =15 
float  fraction  »  p_of_10  5 

while< (c=getchar() )  ==  '  '  1 1  c  ==  '\n'  )  5 
if(c  ==  EOF  )  run.error(9)  5 

Lf(e  ==  'd'  )  -C  d-flag  =  15  /%  a  don't  care  symbol  %/ 

return<0)  5> 

if (c  ==  \l  (getcharO  ==  '.')  II  (getcharO  ==  '.')  ) 

•C  d-flag  =  -1  5  /*  seouence  terminated  1/ 

returm'O)  5> 

if(c  ==  '-')  <  c=aetchar<)  5  minus  =  -1  5> 
int-part  =  0  5 

while(isdisit(c)5  <  int_?art  =  (10  *  int-part)  +  (c  -  '0')  5 

c  =  getcharO  5  > 
if<  c  !*  run_errror<3)  5 
fraction  =  0.0  5 
p-of-10  =  10.0  5 

while(isdigit(c=getchar() ))  <fraction  =  fraction  *  <c  -  '0')  /  P-of_105 

P-of-10  =  P.of-10  *  10.0  5  > 

if((c  ! =  '  ')  St  (c  ! =  '\n'l  )  run_error(8)  5 
Traction  =  minus  *  (  int-Part  +  fraction  )  » 
d-flag  =  0  5 
return( fraction)  5 


isdigit(d)  int  d  5 

if(d  <■  '9'  12  d  >=  '0')  return(l) 5 
return(O)  5 

/ttx*t**t%t*x**%**t***%t***tx$*t%*t*%*t*t*******zttt*zt**ttzt***tt**z/ 

/% 

The  following  routines  keep  track  of  the  position  of  the 
don't  care  symbols  in  the  data  seautncesJ  Each  entry  in 
a  seouence  has  a  corresponding  bit  in  the  array  d-table> 
write_d(s»t>  sets  the  bit  corresponding  to  the  element  t 
of  the  seouence  s  to  1  indicating  a  don't  care  t  while 
read_d(s»t)  returns  the  value  of  the  bit  corresponding 
to  the  element  t  in  seouence  s. 

*/ 

urite-d(S’t)  int  s>t  5 

< 

int  word»  bit  5 

unsigned  pattern  »  1  5 

word  *  t  /  16  5 

bit  =  (t  16)  5 

pattern  =  pattern  «  < 15-bit)  5 

d-tableCslCwordl  =  (d-tableCslCwordl)  1  pattern  5 

return (0)  5 

> 

read_d(s»t)  int  s>t  5 

f 

int  word»  bit  5 
unsigned  pattern  *  1  5 
word  «  t  /  16  5 
bit  »  t  7.  16  5 

pattern  *  pattern  «  (15-bit)  5 
pattern  >  pattern  2  d-tableCsKwordl  5 
if (pattern)  return(l)  5 
return(0)  I 

> 
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/WI  ERROR  ROUTINES  ****** / 


/* 

A  routine  to  check  the  bounds  of  working  arrays*  the  array 
to  be  checked  is  determined  by  the  argument  i. 

*/ 

check* i»  ptr)  int  i/Pir  ? 

p 

\ 

if(ptr  <  overflowCil)  return* 1)  » 
switch* i) 


case  0  I 
case  1  *. 
case  2  : 
case  3  : 
case  4  t 
case  5  *. 
case  6  J 
case  7  i 

> 

exit**))  ) 

v 


■C  printf*'***  program  array  overflow  ***\n')  * 
exit*0)  }  > 

<  printf* '£**  working  stack  overflow  ***\n * )  5 
exit**))  *  > 

•C  printf*'***  symbol  table  overflow  <**\n')  t 
exit(O)  i  > 

{  printf* '*£*  bound  table  overflow  ***\n')  J 
exit(0)  i  > 

<  printf*'***  seauence  store  overflow  **\n‘)  • 
exit(O)  J  > 

<  printf* '***  MAXT  should  be  less  than  2d\n'*  Maxtime)* 
exit**))  *  > 

<  printf*'***  temporary  ratio  list  overflow  **\n')> 
exit(O)  >  > 

<  printf*'***  real  constants  storage  overflow  **\n')) 
exit(O)  f  > 


/:m*******m*********************x*******************/ 

/* 

A  procedure  to  print  run  time  error  messages  and  stop 
execution.  The  message  to  be  printed  is  determined 
by  the  argument  i, 

*/ 


run_error*i)  int  i  * 

{ 

switch* i)  - 
< 


esse 

i  : 

< 

case 

2  : 

{ 

case 

3  : 

{ 

case 

4  : 

< 

case 

s  : 

< 

case 

6  ! 

< 

case 

7  ! 

< 

case 

8  : 

< 

case 

?  : 

< 

case 

10: 

< 

case 

iu 

< 

case 

12: 

< 

case 

13: 

< 

case 

14} 

< 

case 

13} 

< 

printf*'**  seauence  array  out  of  bound  \n')> 
exit**))  *  > 

printf*'**  too  many  array  arguments  \n')  * 
exit<0)  *  > 

printf*'**  too  few  array  arguments  \n')  * 
exit(0)  *  > 

printf*'**  only  parameters  may  be  used  in  seau.  declaration  \n')> 
exit*0)  »  > 

printf ('**  expecting  a  par.  or  an  index  in  sea.  specification  \n')» 
exit(0)  !  > 

printf*'**  rOR  variables  must  be  declared  as  INDEX  ')  * 
exit(0)  *  > 

printf*'**  wrong  number  of  arguments  in  Multiplexer  list  \n')» 
exit<0)  5  > 

printf*'**  Format  error  in  input  file  **\n')  * 
exit(0)  *  > 

printf*'**  insufficient  data  in  input  file  **\n')» 
exit(0)  >  > 

printf*'**  incorrect  model  or  non  causal  eauations  X*\n')* 
exit*0)  >  > 

printf*'**  Inconsistent  system  of  eauations  \n')  * 
printf*'**  Attempt  to  overwrite  a  seauence  \n')  ! 

■ixit(O)  )  > 

printf*'**  array  of  seauences  not  declared  \n')  ) 
exit(0)  )  > 

printf*'**  seauence  not  declared  \n')  i 
axit(O)  i  y 

printf*'**  missing  argument  list  \n')  ) 
exit**))  «  > 

printf*'**  variable  already  declared  \n  ')  ) 
exit**))  )  > 


> 


> 


/*******  END  OF  PROGRAM  LISTING  ********/ 


LJ 
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